Multiple-Goal Reinforcement Learning with Modular Sarsa(0)

نویسندگان

  • Nathan Sprague
  • Dana H. Ballard
چکیده

We present a new algorithm, GM-Sarsa(0), for finding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processes. According to our formulation different sub-goals are modeled as MDPs that are coupled by the requirement that they share actions. Existing reinforcement learning algorithms address similar problem formulations by first finding optimal policies for the component MDPs, and then merging these into a policy for the composite task. The problem with such methods is that policies that are optimized separately may or may not perform well when they are merged into a composite solution. Instead of searching for optimal policies for the component MDPs in isolation, our approach finds good policies in the context of the composite task. This material is based upon work supported by a grant from the Department of Education under grant number P200A000306, a grant from the National Institutes of Health under grant number 5P41RR09283 and a grant from the National Science Foundation under grant number E1A-0080124.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Multiple-Goal Reinforcement Learning with Modular Sarsa(O)

We present a new algorithm, GM-Sarsa(O), for finding approximate solutions to multiple-goal reinforcement learning problems that are modeled as composite Markov decision processes. According to our formulation different sub-goals are modeled as MDPs that are coupled by the requirement that they share actions. Existing reinforcement learning algorithms address similar problem formulations by fir...

متن کامل

Global Policy Construction in Modular Reinforcement Learning

We propose a modular reinforcement learning algorithm which decomposes a Markov decision process into independent modules. Each module is trained using Sarsa(λ). We introduce three algorithms for forming global policy from modules policies, and demonstrate our results using a 2D grid world.

متن کامل

Car Simulation Using Reinforcement Learning

This project report presents the result of Reinforcement Learning (RL) experiments in a car simulation. W ithout any knowledge of the tracks in advance, the car can be trained to avoid bumping into the walls by learning from the given rewards. We have built a car simulation system in which the car can be trained and tested on the tracks with several RL algorithms , including Actor-Critic method...

متن کامل

ارائه الگوریتم جدید Fuzzy SARSA بهمنظور پیش بینی نوسانات سطح قند خون بیماران مبتلا به دیابت نوع یک

Background: One of the serious complications of type 1 diabetes is a sudden increase and drop in blood glucose levels causing risks of anesthesia and coma. Thus, an important step towards the optimal control of the disease is to use intelligent methods with low error rate and available information in order to predict and prevent such complications. In this paper, a combined Fuzzy SARSA algorith...

متن کامل

Fuzzy Sarsa: An approach to linear function approximation in reinforcement learning

This paper investigates two different approaches to learning using an agent electronic marketplace as test bed. The types of learning considered in this paper include the temporal difference (TD) learning algorithm Sarsa, and two new fuzzified versions of this algorithm, FQ Sarsa and Fuzzy Sarsa. We implement the three learning algorithms in an agent test bed in order to determine their usefuln...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003